Using Supertag in MUC-7 Template Relation Task

نویسندگان

  • Libin Shen
  • Jinying Chen
چکیده

The Template Relation (TR) task is an information extraction problem introduced in the 7th Message Understanding Conference (MUC-7). In this paper, we have proposed an approach to convert this problem into a discriminative one. We obtain F-Measure of 78% on sentence-level relation which is comparable to the best system presented in MUC-7, while almost no extra annotation work is required. In our approach, we first use Supertagger [Joshi 1994] and Lightweight Dependency Analyzer [Srinivas 1997] to do shallow parsing on the text. Then features are abstracted from the shallow parsing result. We use these features as weak hypotheses, and combine them into the final one with the boosting algorithm [Schapire 2000].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Description of the Oki System as Used for MUC-7

This paper describes the Oki Information Extraction system as used for MUC-7 evaluation [1][2]. The tasks we have conducted are Named Entity, Co-reference, Template Element and Template Relation. Each module is implemented using MT system modules and pattern recognition modules. Our purposes to participate MUC-7 evaluation are to evaluate howMT system modules are e ective for other application ...

متن کامل

TASC: Description of the TASC System Used for MUC-7

TASC has recently developed a technology for learning scenario-templateextraction grammars from examples provided by end-users with no special computational or linguistic knowledge. For straightforward scenario-template problems, complete extraction systems can be learned from scratch in a few hours. The learned systems are fast, robust, and as accurate as carefully handcrafted systems. For mor...

متن کامل

Appendix D: MUC-7 Information Extraction Task Definition (version 5.1)

Information extraction in the sense of the Message Understanding Conferences has been traditionally defined as the extraction of information from a text in the form of text strings and processed text strings which are placed into slots labeled to indicate the kind of information that can fill them. So, for example, a slot labeled NAME would contain a name string taken directly out of the text o...

متن کامل

MUC-3 evaluation metrics

The MUC-3 evaluation metrics are measures of performance for the MUC3 template fill task. Obtaining summary measures of performance necessitates the los s of information about many details of performance . The utility of summary measures for comparison of performance over time and across systems should outweigh thi s loss of detail . The template fill task is complex because of the varying natu...

متن کامل

SRA: description of the SRA system as used for MUC-6

INTRODUCTIO N SRA used the combination of two systems for the MUC–6 tasks : NameTag"" , a commercial software product that recognizes proper names and other key phrases in text ; and HASTEN, an experimental text extraction system that has been under development for only one year . For the Named Entity task, SRA adapted a subset of NameTag ' s capabilities to the MUC–6 specification . For the Te...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007